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MY EXPERIENCE WITH THE HILLEGAS SCALE 



CHARLES GUNTHER 
Principal School No. 2, Poughkeepsie, New York 



If the supervisor in English uses the Hillegas scale or some 
similar scale, he can supervise more intelligently the work of his 
teachers. No longer need he rely on vague opinions. He feels 
that he has his work well in hand. He knows with reasonable 
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accuracy the quality and progress of the theme work of the pupils 
under his supervision; he knows that it is possible to maintain in 
his school standards for promotion that shall be reasonably free 
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from mere "individual idiosyncrasy, not to mention temporary 
mood or caprice." 

The supervisor may collect sets of themes at the beginning 
of the term and grade them. In the middle of the term, before it 
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is too late, another set may be called for and graded. By com- 
paring the first set with the second, improvement or lack of improve- 
ment may be noted with reasonable accuracy. The amount of 
improvement by years can also be discovered. For example, in 
October, 191 7, I called for sets of themes. Among them was a 
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set from a second-year high-school English class and a set from a 
third-year high-school English class. After grading them by the 
Hillegas scale I had the results as indicated in Table I. 

The third-year class, as can be seen by the foregoing figures, 
showed a considerable advancement over the second-year class. The 
former, with twenty-five students, had fifteen themes with quality 
838, while the latter with more students, twenty-eight, had three. 
The median of English III was 838, while the median of English 
II was 675. So I feel confident that English III did better work. 

TABLE III 
Distribution Tables and Graphs of the Differences, with the Medians 
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That percentage marks ought not to be taken too seriously is 
shown by the following experiment. Two teachers from Pough- 
keepsie High School and I graded a set of thirty-five first-year 
English compositions last October. Two months later I asked the 
same teachers to grade the same papers (the first marks having been 
removed in the meantime). The result is given in Table II. 

These differences in per cent ratings are really greater than they 
seem. The range of marks is not 0-100 but 40-93 in the case of 
Miss A, 40-85 in the case of Miss B, and 45-85 in the case of 
Mr. Gunther. In case of Miss A the median difference in the two 
ratings was 5-9 per cent; in the case of Miss B, 10-14 per cent; 
in the case of Mr. Gunther, 5-9 per cent. If Miss A varies from 
5 to 9 per cent in a range of 53 points, she would vary about twice 
as much in a range of 0-100. So naturally would the others. 
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In the October, 19 17, trial none of the medians were the same; 
in the December, 19 17, trial none of the medians were the same. 
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TABLE V 
Per Cent Marks December, 1917 
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It can also be seen that in the case of Miss A twenty-four 
of her second marks are higher than her first marks; thirty of 
Miss B's second marks are higher than her first marks; in the case 
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of Mr. Gunther this is true in sixteen cases. What are the reasons 
for this? Here is a probable explanation: Miss B was in better 
physical condition the second time, as she herself stated. In the 
case of all three teachers the first reading was during the rush of 
school time; the second was during the Christmas vacation. This 
had its effect on the marks. The physical condition of the examiner 
certainly has its effect on the ratings given themes. 



Miss A 



MissB 



Mr. Gunther 



Fig. i. 




o-* 5-9 10-14 
—Graph of difference in the two marks 



Miss A is regarded by the students as an easier marker than the 
other teachers in her department. Her median, 80-84, proves that 
the students are right in their estimate. She is the only one who 
marked any of these thirty-five themes above 90 per cent. This 
partly explains why her classes are overcrowded when the students 
elect their English teacher at the beginning of the term. This 
variation in marking a principal must guard against. 

Several months afterward Mr. E. D. Hewes, supervising 
principal of Cobleskill High School, very kindly consented to 
help me grade these same thirty-five themes by means of the 
Hillegas scale. The result is given in Table VI. 
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In case of Mr. Gunther, seven marks were higher the second 
time, one was lower, and twenty-seven were the same. The median 
of the first rating was 474; of the second, 474. 

In the case of Mr. Hewes, two marks were higher, eight were 
lower, and twenty-five were the same. The median of his first 
marks was 474; of his second, 474. The two judges were able 
to agree on twelve themes in the four trials. It would seem 
then that the Hillegas scale produced the more even rating. 
Mr. Gunther has never met Mr. Hewes, has never worked in the 
same school system with him. On the other hand, Miss A, Miss B, 
and Mr. Gunther have worked together for five years, have met 
for conferences, have tried to adopt a percentage scale that would 
do away with as much variability in their ratings as possible. Yet 
the Gunther-Hewes ratings with the Hillegas scale were more 
uniform than the Gunther-A-B ratings with the percentage scale. 

Let us examine the ratings more carefully. Let us compare the 
grades assigned to theme 1 with the grades assigned to theme 14. 
Miss A marked theme 1, 93 per cent and theme 14, 60 per cent. 
Miss B marked theme 1, 80 per cent and theme 14, 48 per cent. 
With the Hillegas scale the two judges judged both to be equal 
in value. If we take themes 8 and 19, we find that with the Hillegas 
scale the two judges marked them each time equal in value, 474. 
But with the per cent scale Miss A marked theme 8, 86 per cent 
and 85 per cent, and theme 19, 73 per cent and 82 per cent. Miss B 
marked theme 8, 61 per cent and 62 per cent, and theme 19, 55 per 
cent and 84 per cent. Mr. Gunther marked theme 8, 80 per cent 
and 70 per cent, and theme 19, 75 per cent and 80 per cent. The 
variations in the marking with the use of the percentage scale in this 
case are marked. Here a principal would have something tangible 
to talk about in his private conference with his teacher. His 
supervision would be intelligent. Why did Miss B mark theme 8 
so low as 61 per cent? Why did Miss A mark theme 8 so high as 
86 per cent? Very likely as a result of such a conference these 
•two teachers would be less radical in their future markings. 

Going back to the second point of this study, we see that by 
examining a set of themes that have been marked by a teacher a 
principal can soon determine by means of the Hillegas scale whether 
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the teacher is a careful marker or not. The rating of themes is a 
matter which cannot be detected by the principal in his visit to the 
classroom; yet this is very important. A student ought to have 
just as much chance to pass with one teacher as with another. 
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Under the percentage system he apparently has not; under the Hille- 
gas system his chances would be better. With a passing mark of 
75 per cent the student who wrote theme 1 would get honors with 
Miss A while the one who wrote theme 14 would fail. Both themes 
were, in the judgment of the two examiners using the Hillegas scale, 
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equal in value. In October, 191 7, under Miss B the students 
writing themes 3, 4, 12, 17, 24, 29, and 34 would have failed, whereas 
they would have passed if they had had either of the other teachers. 
That is, 20 per cent of Miss B's class would have been held back 
one-half year at Poughkeepsie High School if the promotion 
examinations had been held in October, 191 7; whereas if they had 
had the other teachers they would have passed. 

In summing up, I find that with the Hillegas scale or some 
similar scale an experienced English teacher may arrive at the 
quality and progress of the theme work of students with reasonable 
accuracy, that with the Hillegas scale or some similar scale it will 
be possible to maintain more even standards for promotion which 
will be reasonably free from temporary moods or caprices, and 
lastly, that with the Hillegas scale the ratings given by a group of 
experienced English teachers to a set of pupils' papers will be more 
uniform than those given when the per cent scale is used. 



